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Abstract: The natural Hilbert Space of quantum particles can imple- 
ment maximum- likelihood (ML) decoding of classical information. The 'Quan- 
tum Product Algorithm' (QPA) is computed on a Factor Graph, where function 
nodes are unitary matrix operations followed by appropriate quantum measure- 
ment. QPA is like the Sum-Product Algorithm (SPA), but without summary, 
giving optimal decode with exponentially finer detail than achievable using SPA. 
Graph cycles have no effect on QPA performance. QPA must be repeated a 
number of times before successful and the ML codeword is obtained only after 
repeated quantum 'experiments'. ML amplification improves decoding accuracy, 
and Distributed QPA facilitates successful evolution. 

Keywords: Factor Graphs, Quantum Computation, Quantum Algorithms, 
Sum Product Algorithm, Graph Algorithms 
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1 Introduction 



Recent interest in Turbo Codes M and Low Density Parity Check Codes |6[ 
has fuelled development of Factor Graphs and associated Sum-Product Algo- 
rithm 0] (SPA), with applications to error-correction, signal processing, 
statistics, neural networks, and system theory. Meanwhile the possibility of 



Quantum Computing has sparked much interest |j, 10], and Quantum Bayesian 



Nets have been proposed to help analyse and design Quantum Computers 



12, 11 1 . This paper links these areas of research, showing that quantum re- 
sources can achieve maximum-likelihood (ML) decoding of classical information. 
The natural Hilbert Space of a quantum particle encodes a probability vector, 
and the joint-state of quantum particles realises the 'products' associated with 
SPA. SPA summary is omitted as quantum bits (qubits) naturally encode the 
total joint-probability state. Dependencies between vector indices become 'en- 
tanglement' in quantum space, with the Factor Graph defining dependency 
(entanglement) between qubits. Graph function nodes are implemented as uni- 
tary matrix -vector products followed by quantum measurement. This is the 
Quantum Product Algorithm (QPA). As QPA avoids summary it avoids prob- 
lems encountered by SPA on graphs with short cycles. Moreover, whereas SPA 
is iterative, using message-passing and activating each node more than once, 
QPA does not iterate but must successfully activate each node only once. How- 
ever the (severe) drawbacks with QPA are as follows: 1) Each function node 
must be repeatedly activated until it successfully 'prepares' it's local variable 
nodes (qubits) in the correct entangled state - any activation failure destroys 
evolution in all variable nodes already entangled with local variables. 2) Once 
a complete Factor Graph has successfully evolved, final quantum measurement 
only delivers the ML codeword with a certain (largest) probability. Repeated 
successful evolutions then determine the ML codeword to within any degree of 
confidence. This second drawback can be overcome by suitable "ML Amplifi- 
cation" of QPA output, prior to measurement. 



'Unitary' means that U satisfies UU^ = I, where | means 'conjugate transpose'. 
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Section |2| presents QPA, highlighting its ability to deliver the optimal out- 
put joint-state, unlike SPA. Quantum systems describe the exact joint-state by 
appropriate 'entanglement' with and measurement of ancillary qubits. Section 
U considers a simple example of QPA on Quantum Factor Graphs, showing 
that iteration on graphs with cycles is unnecessary because QPA avoids pre- 
mature summary. Section |^ shows how to amplify the likelihood of measuring 
the ML codeword from QPA output. Unfortunately QPA must be repeated 
many times and/or executed in parallel to have a hope of successful comple- 
tion. Suitable distributed QPA scheduling is discussed in Section [|, and it is 
argued that successful QPA completion is conceivable using asynchronous dis- 
tributed processing on many-node Factor Graphs. This paper does not deal 
with phase properties of quantum computers. It is expected that the inclusion 
of phase and non-diagonal unitary matrices will greatly increase functionality 
of the Quantum Factor Graph. 

The aim of this paper is not to propose an immediately realisable imple- 
mentation of a quantum computer. Rather, it is to highlight similarities be- 
tween graphs for classical message-passing, and graphs that 'factor' quantum 
computation. The paper also highlights the differences between the two graphs: 
whereas classical graphs can only ever compute over a tensor product space, the 
quantum graph can compute over the complete entangled (tensor-irreducible) 
space. 

2 The Quantum Product Algorithm (QPA) 

2.1 Preliminaries 

Consider the Factor Graph of Fig HI. 




Figure 1: Two-Qubit Factor Graph 
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, where \g k \ 2 = 1 - |/ fc | 2 , 
and flg k + / fc $ = 0, Vfc. 
'*' means complex conjugate. 



Uf g is unitary, and the Uf of Fig [l] and subsequent 



figures always implies the action of Uf g together with the measurement of an 
ancillary qubit, z, as described below. A qubit, be in states or 1 

or in a statistical superposition of and 1. Let qubits xq,x\ be initialised 
(by the black boxes) to states xo = («o> Ai) T and x\ = (ai,/?i) T , where a{,Pi 
are complex probabilities such that \ai\ 2 + \(3i\ 2 = 1. For instance, xq is in 
states and 1 with probabilities |«o | 2 and |/?o| 2 ) respectively. Let an ancil- 
lary qubit, z, be initialised to state 0, i.e. z = (1,0). Then the initial joint 
probability product-state of qubits xo,x\,z is A = (ao,A)) T (8> (ati,(3x) T ® 
(1,0) T = (a ai, p ai,a Pi, P f3i,0, 0,0, 0) T = (s , si, s 2 , s 3 , 0, 0, 0, 0) T , where 
| so 1 2 + | s 1 1 2 + | S2 1 2 + | S3 1 2 = 1, and '(8)' is the tensor product. The element at 
vector index v is the probability that the qubits are in state v. For instance, 
qubits xqX\z are in joint-state 010 with probability | S2 1 2 - Subsequent measure- 
ment of a subset of the qubits projects the measured qubits to a fixed substate 
with a certain probability, and 'summarises' the vector for the remaining non- 
measured qubits. Thus QPA is as follows, 



• Compute S = Uf g A. 

• Measure qubit z. With probability pj = |so/o| 2 + l s i/i| 2 + 1 s 2 y 2 1 2 + IS3/3I 2 
we collapse z to 0, and xq, x\ to joint-state Sf = /xo(so/o; s\f\, S2/2, szfz) T ■ 
With probability p g = |so9o| 2 + |si<?i| 2 + | S2<?2 1 2 + I -5393 1 2 we collapse z 
to 1, and xq,x\ to joint-state S g = ^i(sogo, S191, S292, . p>o and /ii 
are normalisation constants, pj +p g = 1. Sf is our desired QPA result. 
Successful QPA completion is self-verified when we measure z = 0. 

In contrast, classical SPA computes Sf = Uf A (with probability 1) and must 
then perform a subsequent 'summary' step on Sf before returning a result for 
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each variable separately. This result is, 

X = |^ | 2 (|S0/0| 2 + |S2/2| 2 , |si/l| 2 + |s3/3| 2 ) T , X\ = \ /JLl | 2 (| S /o 1 2 + | | 2 , |«2/2| 2 + 
l«3/ 3 | 2 ) T . 

For instance, for xq = we sum the two classical [] probabilities of Sf 
where xq = to get |so/o| 2 + |s2./2| 2 - Similarly, for xo = 1 we summarise 
to |si/i| 2 + IS3/3I 2 . It is in this sense that SPA is a 'tensor-approximation' of 
QPA. 

We identify the following successively accurate computational scenarios (de- 
coding modes) for a space of N binary-state variables: 

• Hard-Decision operates on a probability space, 

(a , A)) ® (ai,/?i) ® . .. ® (a N -i,(3 N -i), a,/3 G {0, 1} 

• Soft-Decision operates on a probability space, 

(ao, A)) <S> (ai, <8>. . .<8> (ajv-i; Pn-i), a,(3 £ {Real Numbers 0^1} 

• Quantum Soft-Decision operates on a probability space, 

(ao, /?o) ® <8> . . . ® (ajv-i, Pn-i)i a,/3e {Complex Numbers} 

• Entangled-Decision operates on a probability space, 
(sq, si, «2) • • • j s 2 iv_ 1 ), s G {Complex Numbers} 

All four of the above Decision modes satisfies the probability restriction that 
the sum of the magnitude-squareds of the vector elements is 1. Both Quantum 
Soft-Decision and Entangled-Decision make use of the natural quantum statis- 
tical properties of matter, including the property of Superposition. Moreover, 
Entangled-Decision operates over exponentially larger space. Classical SPA op- 
erates in Soft-Decision mode. QPA operates in Entangled-Decision mode. In 
the previous discussion it was assumed that the QPA was operating on input of 
the form, (ao, A)) T ® («i> Al) T <S> (1,0) t . More generally, QPA can operate on 
input and deliver output in Entangled-Decision mode. This is in strong contrast 
to SPA which must summarise both input and output down to Soft-Decision 

3 Classical SPA probabilities in this paper are always represented as the magnitude-squared 
of their quantum counterparts 
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mode. It is this approximation that forces SPA to iterate and to sometimes fail 
on graphs with cycles. 

Consider the following example. If the diagonal of Uf is (1, 0, 0, 1), then Uf 
represents XOR, and Fig |] decodes to codeset C = {00, 11} (i.e. xq + x\ = 
0, mod 2). C has distance 2, which is optimal for length 2 binary codes: in 
general if Uf cannot be tensor-decomposed then it represents a code C with 
good distance properties. Initially, let xq = (V0.4, V0.6) T , x\ = (V0.6, \/0.4) T . 
Then A = (VM, 0.6, 0.4, ^024, 0, 0, 0, 0) r , and S f = ^(1,0, 0, 1) T . p f = 0.48, 
so, on average, 48 Sf outputs are computed for every 100 QPA attempts. The 
ML codeword is both 00 and 11, and when Sf is measured, 00 and 11 are 
equally likely to be returned. In contrast, classical SPA for the same input 
returns xq = x\ = (|, 5), implying (wrongly) an equally likely decode to any 
of the words 00,01, 10, 11. So even in this simplest example the advantage of 
QPA over SPA is evident. 

2.2 Product Space for Classical SPA 

Because xq and x\ are separated in Fig |], their classical joint-state only repre- 
sents tensor product states (Soft-Decision mode). An equivalent Factor Graph 
to that of Fig [I] could combine xq and x\ into one quaternary variable which 
would reach all non-product quaternary states. But this requires 'thickening' 
of graph communication lines and exponential increase in SPA computational 
complexity. Consequently only limited variable 'clustering' is desirable, al- 
though too little clustering 'thins out' the solution space to an insufficient 
highly-factored product space. This is the fundamental Factor Graph trade- 
off - good Factor Graphs achieve efficient SPA by careful variable 'separation', 
ensuring the joint product space is close enough to the exact (non-summarised) 
non-product space. 

2.3 Entangled Space for QPA 

In contrast, although xq and x\ are physically separated in Fig |l|, quantum non- 
locality must take into account correlations between xq and x\. Their joint-state 
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now occurs over the union of product and (much larger) non-product (entan- 
gled) space (Entangled-Decision mode). An entangled joint-state vector cannot 
be tensor- factorised over constituent qubits. QPA does not usually output to 
product space because the joint-state of output qubits is usually entangled. In 
fact QPA is algorithmically simpler than SPA, as SPA is a subsequent tensor 
approximation of QPA output at each local function. 

2.4 Example 

Let the diagonal of Uf be (1,0,0,1). Initialise xq and x\ to joint-product- 
state, x = -^=(1,\/2) T , xi = -^(1,1) T . With probability p f = 0.5 QPA 
measures z = and computes the joint-state of x , X\ as Sf = 0, 0, V2) T . 

A final measurement of qubits xo and x\ yields codewords 11 and 00 with 
probability |, and |, respectively. In contrast SPA summarises Sf to xq = x\ = 
2). Although a final 'hard-decision' on xq and x\ chooses, correctly, the ML 
codeword x$ = x\ = 1, the joint-product-state output, ^(1,2) T <g) |(1,2) T = 
^(1,2,2,4) T assigns, incorrectly, a non-zero probability to words 01 and 10. 

2.5 A Priori Initialisation 

To initialise xq to (a>o,(3o) T , we again use QPA. Let the diagonal of Uf (for 
the left-hand black box of Fig ||) be (ao,Po)- Then the diagonal of U g is 
_j_^ "oV 1 H a o| ^ /3 y/i-}/3o \_ y ^ Measurement of z = initialises xo to (ao,f3o) T , 

and this occurs with probability pj = 0.5. x% is initialised likewise. 

2.6 Comments 

The major drawback of QPA is the significant probability of QPA failure, oc- 
curring when z is measured as 1. This problem is amplified for larger Quantum 
Factor Graphs where a different z is measured at each local function; QPA 
evolution failure at a function node not only destroys the states of 
variables connected with that function, but also destroys all states of 
variables entangled with those variables. QPA is more likely to succeed 
when input variable probabilities are already skewed somewhat towards a valid 



S 



codeword. Section ^ shows how QPA can operate successfully even when SPA 
fails. 

3 Quantum Product Algorithm on Factor Graphs 
with Cycles 

This section shows that graph cycles do not compromise QPA performance. 
Consider the Factor Graph of Fig 




Figure 2: Factor Graph with a Cycle 

Functions Ufo and Ufi are both 8x8 XOR diagonal matrices with diago- 
nal elements (10010110). Acting on the combined four-qubit space, X0X1X2X3, 
they are the functions Ufo <8> I2 and 1 2 <8> Ufi, respectively, with diagonal ele- 
ments (1001011010010110) and (1100001100111100), respectively, where I 2 is 
the 2x2 identity matrix. QPA on Fig || performs the global function Uf = 
(Ufo 8) I 2 ) (I2 ® U fl ) on four-qubit space, with diagonal elements (1000001000010100), 
forcing output into codeset C = {0000,0110,1011,1101}. Functions U ro , U n , 
and Uf 'sieve' the input joint-state, where Uf is the combination of two 'sub- 
sieves', Ufo and Ufi. QPA iteration (i.e. successfully completing a sub-function 
more than once on the same qubits) has no purpose, as only one needs apply a 
particular sieve once. So graph cycles have no bearing on QPA. (However itera- 
tion may be useful to maintain the entangled result in the presence of quantum 
decoherence and noise). To underline cycle-independence, consider the action 
of SPA, then QPA on Fig g 
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Initialise as follows (using classical probabilities), 

x xix 2 x 3 = (0.1,0.9) T g> (0.6,0.4) T ® (0.6,0.4) T ® (0.6,0.4) T 

Hard-decision gives xqX\X2Xj, = 1000, which can then be decoded algebraically 
to codeword 0000. However optimal soft-decision would decode to either x^X2X\Xq = 
1011 or 1101, with equal probability. Because of the small graph cycle SPA fails 
to decode correctly, and settles to the joint-product-state, 
x xix 2 x 3 = (0.108, 0.892) T <g> (0.521, 0.479) T ® (0.521, 0.479) T <g> (0.601, 0.399) T . 
A final hard-decision on this output gives non-codeword XqXix^xs = 1000 which 
can then be decoded algebraically, again to codeword 0000. In contrast, suc- 
cessful QPA outputs the optimal entangled joint-state, 

S F = 1=(V^16, 0, 0, 0, 0, 0, ^96, 0, 0, 0, 0, ^864, 0, V864, 0, 0) T . Final mea- 
surement of Sf always outputs a codeword from C, and with probability 2 2 *q4q 4 
outputs either 1011 or 1101. QPA evolves on Fig || correctly with probabil- 
ity 0.204. Therefore 1000 attempts produce around 204 correctly entangled 
joint-states. 

To underline QPA advantage, consider the single variable extension of Fig 
^ in Fig where X4 is initialised to (\/0.5, \/0.5) T . 




Figure 3: Extended Factor Graph with a Cycle 

As 24 = £0 © £3, and our original code, C = {0000,0110,1011,1101}, 
always had x$ = 23, then X4 should always be 0. But SPA on Fig || computes 
24 = (0.421, 0.579) T and subsequent hard-decision gives £4 = 1. In contrast, 
successful QPA computes the optimal non-product joint-state, 
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Final measurement of Sp/ always outputs x\ = 0. QPA evolves on Fig |3| 
correctly with probability 0.204 * 0.5 = 0.114. 



4 Maximum-Likelihood (ML) Amplification 



4.1 Preliminaries 



The ML codeword is the one most likely to be measured from QPA output, 
with probability, pm, say. For instance, if QPA output of Fig |l] is Sf = 
^=(1,0,0,V2) T , say, then 11 is the ML codeword, and it is measured with 
probability pu = § • Numerous executions of QPA on the same input will verify 
that 11 is, indeed, the ML codeword. However these numerous executions must 
output to a length 2^ final averaging probability vector (for N qubits). We do 
not want to store such an exponential vector. Instead, therefore, we 'amplify' 
the statistical advantage of 11 over 00 prior to measurement, thereby making 11 
significantly more likely to be read. This is achieved by computing the square 
of each quantum vector element as follows. Consider two independent QPA ex- 
ecutions on the same input, both outputting Sf. Associate these outputs with 
qubits xo,o,^i,Oi and xo.ij^i,!- The joint-state of qubits ^0,0)^1,0)^0,1)^1,1 is, 



V = S f <g> S f = -(1, 0, 0, V2, 0, 0, 0, 0, 0, 0, 0, 0, A 0, 0, 2) T 
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Consider the unitary permutation matrix 

/ 1000000000000000 \ 
0000010000000000 
0000000000100000 
0000000000000001 
0100000000000000 
0010000000000000 
0001000000000000 
0000100000000000 

p = 

0000001000000000 
0000000100000000 
0000000010000000 
0000000001000000 
0000000000010000 
0000000000001000 
0000000000000100 
\ 0000000000000010 J 

Only the '1' positions in the first four rows are important. Performing P on 

zo,o, Zi,o,z ,i, a;i,i, gives, 

PV = 3(1, 0, 0, 2, 0, 0, V2, 0, 0, 0, 0, 0, 0, V2, 0, 0) T 

We then measure qubits XQ t \,x\ t \. With probability p ao = | we read xo,i = 
X\ t \ = 0, in which case xo,o and x\$ are forced into joint state S^i = 4^(1,0,0,2), 
which is the element-square of Sf. A measurement of Sf^ returns 11 with prob- 
ability pm = §, which is a significant improvement over pm = §• Likewise we 
compute the element fourth-powers of Sf by preparing two independent qubit 
pairs in Sf t i and permuting the (umeasured) joint state vector V\ = Sf : i ® 
to give PVi, and then measuring the second pair of qubits. With probability 
p ai = H we read this pair as 00, in which case the first two qubits are forced 
into the joint-state Sf^ = -^(1,0,0,4), which is the element fourth-power of 
Sf. A measurement of Sf^ returns 11 with probability pu = xf , which is a 
further improvement over pu = §■ In this way we amplify the likelihood of 
measuring the ML codeword. To compute the element 2 fc * n -power, Sf t k, we re- 
quire, on average, -p- independent preparations, Sf k _ l , each of which requires, 

^ a k 

on average, — - — independent preparations, St,_„, and so on. 

Pa k _ 1 ■ " 

We can perform QPA on large Factor Graphs, then amplify the result k times 
to ensure a high likelihood of measuring the ML codeword, as described above. 
However the above amplification acts on the complete graph with one operation, 
P. It would be preferable to decompose P into 4x4 unitary matrices which only 
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act on independent qubit pairs x^o and x^i, thereby localising amplification. 
Consider, once again, Fig [p. From the point of view of xq,i, £0,0 appears to be 
in summarised state , Sf = -^(1,\/2) T . Similarly, from the point of view of 
^0,0) ^0,1 appears to be in state sj. Thus a^o, ^o,i appear to be in joint product 
state vq = |(1, y/2, v2, 2) T . Consider unitary permutation matrix, 

/ 1000 \ 
0001 I 

Q = 

0100 I 
\ 0010 J 

We compute Qt>o = |(1, 2, y/2, y/2) T on qubits xo,o, xo,i and measure qubit xq,i- 
With probability p ao = | we read xq,i = 0, in which case xo,o is forced into joint 
state Sf^i = 77|(1> 2), which is the element-square of Sf. Due to the exact form 
of our joint-state vector, Sf, this single measurement is enough to also force 
#0,0^1,0 into joint state Sf t i. However, for a general function Sf, we should 
perform Q on every qubit pair, XifiX^x, then measure x^i Vi. This is equivalent 
to performing P' = Q ® Q on (re-ordered) joint-state vector £0,0^0,1^1,0^1,1, 
and this is identical to performing P on xo, 0^1, 0^0, 1^1,1 • The probability of 
measuring xiq = X\\ = is the same whether P or Q is used. The same 
process is followed to achieve element 2 fc th powers. 



4.2 The Price of Amplification 

There is a statistical cost to qubit amplification. Let s = (a, f3) T be the initial 
state of a qubit x, where, for notational convenience, we assume that a and (5 are 
both real. Then a 2 +(3 2 = 1 and, given 2 k qubits all identically prepared in state 
s, the likelihood of preparing one qubit in (unnormalised) state Sk = {a 2k ,j3 2h ) T 
is 7^, where, 

2 r k+i n 
7fc = 7fe-i— 2~, 70 = 1 
r k 

4 xo,o is generally not in this summarised state, due to phase considerations, but the view- 
point is valid for our purposes as long as subsequent unitary matrix operations on xO only 
have one non-zero entry per row. 
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and rk = a 2 ' + P 2 . For a qubit in state Sf., the probability of selecting the ML 
codebit is, 




o.5*< 1-* 1 1 1 * 1 1 1 1 *- a 2 = 0.5 

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 



Figure 4: Amplification Success Probability, 7^, v ML Advantage, PMk 

Each of the 25 lines in Fig |] refers to a different value of a 2 , for a 2 from 
0.5 up to 0.98 in steps of 0.2. The initial state, s, when k = 0, occurs with 
probability 7^ = 1, and is marked on the right-hand side of Fig || for each of the 
25 lines. After one amplification step, k = 1, and another 25 points are marked 
on the graph to the left of the points for k = 0, indicating that a successful 
amplification step has occurred with probability 7jt < 1. Similarly points for 
k = 2, k = 3, ..etc are marked successively to the left on Fig |. The y-axis 
shows the ML advantage, Pmh, which can be achieved with probability 7^ after 
k steps for each value of a 2 . For instance, when s = (a, j3) T = (V0.62, ^0.38)^, 
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then an ML advantage of Puk = 0.9805 can be ensured after k = 3 steps, and 
this can be achieved with probability jf. = 0.0223 given 2 3 = 8 independently 
prepared qubits, all in state s. Amplification is more rapid if s already has 
significant ML advantage (i.e. when a is high). In contrast if a 2 = 0.5 then 
no amplification of that qubit is possible. This is quite reasonable as, in this 
case, both states and 1 are equally likely, so there is no ML state. Successive 
measurement of zero of all second qubits of each qubit pair self- verifies that we 
have obtained successful amplification. If, at any step, k, the second qubit of 
the qubit pair is measured as one then amplification fails and the graph local 
to this qubit which has been successfully entangled up until now is completely 
destroyed. 

5 Distributed QPA on Many-Node Factor Graphs 
5.1 Preliminaries 

In classical systems it is desirable to implement SPA on Factor Graphs which 
'tensor-approximate' the variable space using many small-state variables (e.g. 
bits), linked by small-dimensional constituent functions, thereby minimising 
computational complexity. In quantum systems it is similarly desirable to im- 
plement QPA on Factor Graphs using many small-state variables (e.g. qubits), 
linked by small-dimensional constituent unitary functions. Any Quantum Com- 
putation can be decomposed into a sequence of one or two-bit 'universal' gate 
unitary operations ||0 . Computational complexity is minimised by using small- 
dimensional unitary matrices for constituent functions. Moreover, fine granu- 
larity of the Factor Graph allows distributed node processing. This appears to 
be essential for large Quantum Factor Graphs to have acceptable probability of 
successful global evolution, as we will show. Distributed QPA allows variable 
nodes to evolve entanglement only with neighbouring variable nodes so that, if 
a local function measurement or amplification is unsuccessful, only local evo- 
lution is destroyed. Remember that local evolution is OFTEN unsuccessful, 

This also implies that any classical Factor Graph can be similarly decomposed. 
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as failure occurs when a local ancillary qubit, z, is measured as 1, or when a 
local amplifying qubit is measured as 1. Therefore node localities with high 
likelihood of successful evolution (i.e. with positively skewed input probabili- 
ties) are likely to evolve first. These will then encourage other self-contradictory 
node localities to evolve successfully. In contrast, non-distributed QPA on large 
Factor Graphs using one large global function is very unlikely to ever succeed, 
especially for graphs encoding low-rate codes. 




Figure 5: Distributed QPA (top) Non-Distributed QPA (bottom), 4-bit code 

To illustrate the advangtage of distributed QPA, consider the low rate code 
of Fig [|, where Ufy = diag(l,0,0, 1). Both top and bottom graphs represent 
the code C = {0000,1111}, where U is a combination of XOR sub-matrices, 
Ufoi,Uf!2, and Urn- The top graph distributes processing. We allow Ufoi 
and Uf23 to operate independently and in parallel. Moreover, if Ufoi fails to 
establish, then it does not destroy any successful evolution of Uf23, as the two 
localities are not currently entangled. Once both Ufoi and Uf23 have completed 
successfully, the subsequent probability of successful completion of Ufi2 is, in 
general, likely to increase. So distributing QPA increases likelihood of successful 
evolution of the complete Factor Graph. We now demonstrate this graphically. 
Let qubits 20,21,22,23 of Fig ||| initially be in states 20 = (ao,(3o) T , x\ = 
(ai,/?i) T , 22 = («2>/92) T j %3 = (ct3 5 /33) T , where, for notational convenience, we 
assume all values are real. Then af + = 1, Vz. The probability of successful 



16 



completion of Ufoi is p/oi = (ao«i) 2 + (A)/3i) 2 > and probability of successful 
completion of Uf23 is p/23 = (a2«3) 2 + (fyfts) 2 - Therefore the probability of 
successful completion of both Ufoi and Uf23 after exactly q parallel attempts 
(no less) is, 

Po _ 3 (<z) - (l-p/oO^l-prf-VoiPps+a-^ 

Given successful completion of Ufoi and Uf23, the probability of subse- 
quent successful completion of Ufi2 is, 

, _ (a aia 2 a 3 ) 2 + (PoPifofo) 2 
Pf 12 — 

Pf0lPf23 

Therefore the probability of successful completion of Ufoi and Uf23, immedi- 
ately followed by successful completion of Un.2 is, Po-»3(<z) = Pos(q — l)P/i 2 > 
and the probability of successful completion of Ufoi and Uf23, immediately 
followed by completion failure of Ufi2 is, pQ^(q) = pos(q — 1)(1 — p'f 12) ■ 
Therefore the probability of successful completion after exactly t steps of Ufoi 
and Uf23 in parallel, followed by Ufi2, is, 

* 

Pe{t) = J2P0^3(q) 11^3^) 
q=2 veD(t-q)«£v 

where D(k) is the set of unordered partitions of k. Therefore the probability of 
successful completion after at most t steps of Ufoi and Uf23 in parallel, followed 
by Ufi2, is, 

t 

Pm(t) = ^Pe(i) 
i=2 

In contrast, for non-distributed QPA, the probability of successful completion, 
after at most t steps, of U, (the bottom graph of Fig ||) is P(t) = 1 — (1 — 
(aoaia2a^) 2 — (AdA/^/^) 2 )*- Figs || and[7| show plots of p m {t) and P(t) versus 
t for ao = ol\ = «2 = «3 = w as w varies, and «o = u, <x\ = 02 = «3 = w = 0.9 
as u varies, respectively. For Fig fj], low values of u indicate a contradiction 
between xq and the other three variables. In particular the contradiction is so 
pronounced when ckq = 0-0 that successful QPA completion is highly unlikely. 
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More generally, this indicates that severe internal Factor Graph contradictions 
are fatal to QPA (as they are for SPA). Both Fig |6| and [7| indicate that, due to ini- 
tial latency of distributed processing, non-distributed QPA appears marginally 
faster for the first few steps. However, after a few steps distributed QPA in 
general becomes marginally faster. In fact results are unfairly biased towards 
the non-distributed case, as it is assumed that attempts to complete U and Ufy 
have the same space-time-complexity cost, whereas U is far more costly. Hence, 
even for this smallest example, Distributed QPA outperforms non-Distributed 
QPA. 




2 4 6 8 10 12 14 16 18 20 22 

t 

Figure 6: No of Steps v Non-Distributed and Distributed QPA: Completion 
Probabilities 

The example of Fig || only achieves marginal advantage using Distributed 
QPA because the example has so few nodes. The advantage is more pronounced 
in Fig ||. 

Fig | represents the code C = {000000000,111111111} g , where U ijk = 
6 This code is trivial but demonstrates a 'worst-case' low-rate scenario. In general, codes 

18 



2 4 6 8 10 12 14 16 18 20 22 

t 



Figure 7: No of Steps v Non-Distributed and Distributed QPA: w = 0.9, ao 
varies 

diag(l, 0, 0, 1, 0, 1, 1,0). We allow Ufoi2, Uf^s, and Uf678 to operate indepen- 
dently and in parallel. If Ufoi2 fails to establish, then it does not destroy any 
successful evolution of U045 or Uf 678 , as the three localities are not currently 
entangled. Once Ufoi2, Uf345, and Uf 678 have completed successfully, the 
probability of successful subsequent completion of Uf258 is, in general, ampli- 
fied. Let qubits xt, < i < 9 of Fig [|| initially be in states X{ = (a«, 0) T , where, 
for notational convenience, we assume all values are real. Then af + 01 = 1, 
Vi. Let the probability of successful completion after at most t steps of Ufoi2, 
UK45, and Uf678 in parallel, followed by Uf258) be p m (t), and the probability 
of successful completion, after at most t steps, of a non-distributed version of 
Fig H be P(t). Appendix A derives p m (t) and Pt for this case. Figs Q and 



1Q show plots of p m (t) and P(t) versus t for a.{ = w, Vi, as w varies, and 



ao = u, oci = w = 0.9, Vi, i / 0, as u varies, respectively. For Fig 10 low 



of higher rate, with or without cycles, decode more quickly. 
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Uqi2 






U258 


^ 







Figure 8: Distributed QPA, 9-qubits 

values of u indicate contradiction between xq and the other eight qubits. The 
contradiction is so pronounced when ao = 0.0 that successful QPA completion 
is highly unlikely. 

Figs [ll] and |l2| show plots of p m (t) and P(t) versus t for ao = ot\ = u, a.{ = 
w = 0.9, Vi, i 7^ 0, 1, and qo = as = u, aj = w = 0.9, Vi, i ^ 0,8, respectively, as 
u varies. Both figures indicate contradictions between two qubits and the rest, 
but the scattered nature of contradictions for Fig [l^ (xo and xs are connected 
to different local functions) enhances Distributed QPA performance compared 



to Fig 11 



Figs |6-12 indicate that distributed QPA completes significantly faster than 
non-distributed QPA, in particular for cases requiring many steps, t. Even 
more so as the presented results are unfairly biased towards the non-distributed 
case, as it is assumed that attempts to complete non-distributed U or each 
constituent Ufijk have the same space-time-complexity cost, whereas U is far 
more costly. We conclude that Distributed QPA is essential for large Quantum 
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Figure 9: No of Steps v Non-Distributed and Distributed QPA, 9 qubits 
Factor Graphs. 

5.2 Free-Running Distributed QPA 

Consider the notional Factor Graph of Fig [l3|. Each (square) function node 
activates time-independently on its local (circular) variable nodes. Functions 
successfully completed are marked with an 'X'. After a certain time, say, three 
'areas of success' evolve, due to general agreement between input variable states 
at these localities. This means that variables on the perimeter of each region 
of success are 'encouraged' to agree with the 'general view' of the associated 
region of success. Unfortunately, in the bottom left of the graph is a variable 
(dark circle) which strongly contradicts with the rest of the graph. No area 
of success evolves around it, and it is difficult for other areas of success to 
'swallow' it. Assuming the contradiction is not too strong then, eventually, 
after numerous attempts, the complete graph is marked with 'X's and the Graph 
evolves successfully. At this point the contents of each qubit variable can be 
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Figure 10: No of Steps v Non-Distributed and Distributed QPA: w 2 = 0.9, ao 
varies, 9 qubits 



amplified, and final measurement of all qubits provides the ML codeword with 
high probability. The advantage of a free-running strategy, where each function 
node is free to activate asynchronously, is that regions of general agreement 
develop first and influence other areas of the graph to 'follow their opinion'. Fig 
13 also shows that one 'bad' (contradictory) qubit can be a fatal stumbling block 
to successful evolution of the whole graph (as it can for SPA on classical graphs). 
Thus Distributed QPA requires Fault- Tolerance, where only an arbitrary subset 
of entangled nodes are required as a final result (node redundancy). The free- 
running schedule of Fig ^ naturally avoids the 'bad' qubits, but sufficient 
evolution occurs when enough function nodes complete. Alternatively, bad 
qubits could be set to (V0.5, V0.5) after a time-out. A more detailed proposal 
of Fault-Tolerant QPA is left for future work. 

Fig ^ also serves to illustrate the 'template' for a Reconfigurable Quantum 
Graph Array. One can envisage initialising an array of quantum variables so 
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Figure 11: No of Steps v Non-Distributed and Distributed QPA: w 2 = 0.9, 
ao = Oi\ varies, 9 qubits 

that two local variables can be strongly or weakly entangled by identifying the 
mutual square function nodes with strongly or weakly-entangling matrices, re- 
spectively. In particular, two neighbouring nodes may be 'locally disconnected' 
by setting the function node joining them to a tensor-decomposable matrix, 
(i.e. zero-entangling). The quantum computer is then measurement-driven. 
The concept of measurement-driven quantum computation has also recently 
been pursued in Q, where a uniform entanglement is set-up throughout the 
array ^\ prior to computation via measurement. 

Fig [l4] shows the system view of QPA. A continual stream of pure qubits 
needs to be initialised and then entangled, and then amplified, so as to ensure at 
least one successful entangled and amplified output from the whole apparatus. 



7 It is interesting that this entanglement is strongly related to Rudin-Shapiro and quadratic 
constructions Q 
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Figure 12: No of Steps v Non-Distributed and Distributed QPA: w 2 = 0.9, 
ao = «8 varies, 9 qubits 

6 Phase QPA 

The above discussions have ignored the capacity of Quantum Systems to carry 
phase information. In fact QPA, as presented so far, is immune to phase mod- 
ification, as classical probabilities have no phase component. However QPA 
should be generalised to cope with phase shift in order to decode quantum 
information. This is the subject of ongoing research. 

7 Conclusion and Discussion 

The Quantum Product Algorithm (QPA) on a Factor Graph has been pre- 
sented for Maximum-Likelihood (ML) Decoding of Classical 'soft' information 
using quantum resources. The relationship of QPA to the Sum-Product Al- 
gorithm (SPA) has been indicated, where avoidance of summary allows QPA 
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to overcome small graph cycles. Quantum Factor Graphs use small unitary 
matrices which each act on only a few qubits. QPA is measurement-driven 
and is only statistically likely to succeed after many attempts. The ML code- 
word is obtained with maximum likelihood by measuring the entangled vector 
resulting from successful QPA. To ensure a high probability of measuring the 
ML codeword QPA output can be amplified prior to measurement. The com- 
plete ML decoder is only successful after many attempts. Finally, free-running 
Distributed QPA is proposed to improve the likelihood of successful QPA com- 
pletion. The free-running distributed structure suggests further benefit will be 
obtained by introducing Fault-Tolerance in the form of redundant function and 
variable nodes. Phase aspects of QPA have yet to be explored. This paper has 
been written to demonstrate the exponential capacity of quantum systems, and 
their natural suitability for graph decompositions such as the Factor Graph. 
The paper has not tried to deal with quantum noise and quantum decoherence, 
but one can expect the Factor Graph form to 'gracefully' expand to cope with 
the extra redundancy necessary to protect qubits from decoherence and noise. 
When viewed in the context of entangled space, it is surprising how success- 
ful classical message-passing algorithms are, even though they are restricted to 
operate in tensor product space. This suggests that methods to improve the 
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Figure 14: QPA with Amplification 



likelihood of successful QPA completion may include the possibility of hybrid 
QPA/SPA graphs, where SPA operates on non-cyclic and resolvable parts of 
the graph, leaving QPA to cope with small cycles or unresolved areas of the 
graph. 



8 Appendix A - Deriving p m {t) and P(t) for Fig |8J 

The probability of successful completion of Ufo 12 , is p/012 = (cto^ia^) 2 + 
(P0P1P2) 2 , and similarly for p /345 and p f678 . Let h 012 = (1 -p/oi2) 9 ~\ ^345 = 
(1 — p/345) 9 " 1 , /i678 = (1 — P/67s) 9 • Then the probability of successful com- 
pletion of Ufoi2, Uf345, and Uf678 after exactly q parallel attempts is, 



P0-3-e(<?) = ^012^345^678P/012P/345P/678 + (1 ~ ^012)^345^678P/345P/678 
+ ^012(1 — ^345)^678P/012P/678 + ^012^345(1 — ^678)P/012P/345 

+ (1 - ftoia)(l - h 345 )h e78 p f67 $ + (1 - h Q12 )h 345 (l - h e78 )p f345 
+hou(l - h 3 4 5 )(l - h 67s )pfai2 

Given successful completion of Ufoi2, Uf345, and Uf678, the probability of 
subsequent successful completion of Uf258 is, 



P/258 



P/012P/345P678 
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Therefore the probability of successful completion of Ufoi2, Uf345, and Uf678, 
immediately followed by successful completion of Uf258 is, Po^s(q) = Po-3-6(<7 — 
l)Pj258> an< ^ * ne probability of successful completion of Ur)12, Uf345, and Uf678, 
immediately followed by completion failure of Uf258 is, Po^s{q) = Po-3~6(q — 
1)(1 — p'f25&)- The probability of successful completion after exactly t steps of 
Ufoi2, Uf34 5 , and U f678 in parallel, followed by Uf2 5 8, is, 

t 

Pe(t) = ^2P0^8(q) YiPO^Eiu) 
5=2 veD(t-q) «£v 

where D(k) is the set of unordered partitions of k. Therefore the probability 
of successful completion after at most t steps of Ufoi2, Uf345, and Uf678 in 
parallel, followed by Uf258, is, 

t 

Pm{t) = J2Pe(i) 
i=2 

In contrast, the probability of successful completion, after at most t steps, of a 
non-distributed version of Fig || is P(t) = 1 — (1 — (aoaia2Q.^a\a^aQa-ja%) 2 — 
(A/Ws&fls/Ws) 2 )*. 
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